Kolmogorov-Smirnov statistic and its application in library design.

نویسندگان

  • D N Rassokhin
  • D K Agrafiotis
چکیده

After several years of frantic development, the dream of an "ideal" library remains elusive. Traditionally, combinatorial chemistry has been used primarily for lead generation, and molecular diversity has been the method of choice for designing and prioritizing experiments. One aspect that often has been overlooked is the drug likeness of the resulting collections. Recently, there have been several attempts to quantify this concept and incorporate it directly into the design process. This article demonstrates the limitations of some conventional methodologies and proposes a new paradigm for experimental design based on the principles of multiobjective optimization. This method allows traditional design objectives such as diversity or similarity to be combined with secondary selection criteria in order to bias the selection toward more pharmacologically relevant regions of chemical space. The method is robust, general, and easily extensible, and it allows the medicinal chemist to create designs that represent the best compromise between several, often conflicting, objectives. Two types of designs are discussed (singles, arrays), and a novel criterion based on the Kolmogorov-Smirnov statistic is proposed as a means to enforce a particular distribution on key molecular properties that are related to drug likeness. The potential of this approach is illustrated in the design of an exploratory library based on the simultaneous optimization of five different parameters. These parameters are combined in an intuitive manner to produce a design that is sufficiently diverse, exhibits a molecular weight and logP profile that is consistent with the respective distributions of known drugs, requires a small number of reagents, and can be synthesized easily in array format using robotic hardware.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Empirical Distribution Function: Properties and Application

The concepts of cumulative distribution function and empirical distribution function are investigated for fuzzy random variables. Some limit theorems related to such functions are established. As an application of the obtained results, a method of handling fuzziness upon the usual method of Kolmogorov–Smirnov one-sample test is proposed. We transact the α-level set of imprecise observations in ...

متن کامل

Empirical Processes , and the Kolmogorov – Smirnov Statistic Math 6070 , Spring 2006

1 Some Basic Theory 1 1.1 Consistency and Unbiasedness at a Point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 The Kolmogorov–Smirnov Statistic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.3 Order Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.4 Proof of the Kolmogorov–Smirnov Theorem . . ....

متن کامل

Distribution Fitting 2

The methods measuring the departure between observation and the model were reviewed. The following statistics were applied on two experimental data sets: ChiSquared, Kolmogorov-Smirnov, Anderson-Darling, Wilks-Shapiro, and Jarque-Bera. Both investigated sets proved not to be normal distributed. The Grubbs’ test identified one outlier and after its removal the normality of the set of 205 chemica...

متن کامل

A comparison of the discrete Kolmogorov-Smirnov statistic and the Euclidean distance

Goodness-of-fit tests gauge whether a given set of observations is consistent (up to expected random fluctuations) with arising as independent and identically distributed (i.i.d.) draws from a user-specified probability distribution known as the “model.” The standard gauges involve the discrepancy between the model and the empirical distribution of the observed draws. Some measures of discrepan...

متن کامل

Application of the Kolmogorov-Smirnov Test to Estimate the Threshold When Estimating the Extreme Value Index

The Pareto distribution model assumption in the peaks over threshold method, will be tested by making using of the Kolmogorov-Smirnov goodness of fit method. Pareto distributed variables can be transformed to exponential, and the test will be for exponentiality. It was found that the statistic can be used as an indication of where to choose the threshold and to check the Pareto model assumption.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of molecular graphics & modelling

دوره 18 4-5  شماره 

صفحات  -

تاریخ انتشار 2000